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Abstract 

Searches for the Standard Model Higgs boson have been performed in the data 
collected by the DELPHI experiment at LEP in the year 2000 at centre-of- 
mass energies between 200 and 209 GeV corresponding to a total integrated 
luminosity of 224 pb~^. No evidence for a Higgs signal is observed in the kine- 
matically accessible mass range, and a 95% CL lower mass limit of 114.3 GeV/c^ 
is set, to be compared with an expected median limit of 113.5 GeV/c^ for these 
data. 
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1 Introduction 

The LEP accelerator was successfully operated at e"'"e~ collision energies up to 209 GeV 
during the year 2000. The DELPHI experiment has collected more than 224 pb^^ at 
centre-of-mass energies above 200 GeV, extending the range of searches for the Standard 
Model Higgs boson above the previous limits obtained by DELPHI by the other 

LEP collaborations, and by their combination by the LEP Higgs Working Group [Q. 

The results shown in this letter are based on the detector calibration obtained shortly 
after the end of data taking. They will be included in the preliminary combination of the 
LEP collaborations results on the 2000 year data |§, being prepared by the LEP Higgs 
Working group [^. 

1.1 Data and simulation samples 

The data used in this analysis, corresponding to a total of 224.1 pb^^ collected by 
the DELPHI detector in 2000, were analysed in the following subsamples: 2.3 pb~^ at 
an average centre-of-mass energy of 202.6 GeV, 6.7 pb^^ at 203.9 GeV, 10.5 pb^^ at 
204.8 GeV, 62.5 pb'^ at 205.2 GeV, 18.2 pb^^ at 206.2 GeV, 115.2 pb^^ at 206.7 GeV 
and 8.7 pb^^ at 208.2 GeV. 

Monte Carlo samples for background events were produced at fixed centre-of-mass 
energies of 202, 204, 205, 206, 207 and 208 GeV using the same simulation setup as for 
the 1999 analysis [||]. The samples correspond to about 200 times the collected luminosity. 

Similarly, signal events were produced using the HZHA generator, varying the Higgs 
boson mass from 85 GeV/c^ to 120 GeV/c^ in 5 GeV/c^ steps, plus a fine scan in the 
most interesting zone, with samples simulated for mass hypotheses 108, 110, 112, 114 
and 115 GeV/c2. 

1.2 Detector overview 

A detailed description of the DELPHI apparatus can be found in ||^. For the first 
three quarters of the year the detector was operated in nominal conditions. 

Data collected after the 1st of September, corresponding to the last 60 pb~^, were 
affected by the complete failure of one sector (S6) of the TPC detector, which amounts 
to 1/12 of the TPC acceptance. Charged particle tracks crossing this sector were re- 
constructed using the information from the Vertex, Inner and Outer detectors, so the 
effect on the efficiency is limited. A complete sample of background and signal channels 
simulated with this TPC sector off was used to incorporate the small effect on the recon- 
structed event kinematics and the impact on the b-tagging efficiency into the analysis of 
this data sample. 

To follow more precisely the change of conditions during the data taking, the cali- 
bration of the impact parameter resolution was performed with the high energy four-jet 
events. The same procedure was applied to the simulation where the four-jet events were 
selected with the same criteria and appropriately weighted according to the predicted 
cross-sections of the corresponding processes. 

The calibration of the b-tagging used the tracks with negative impact parameter, while 
only the tracks with positive impact parameter were used in the lifetime based b-tagging. 

The number of tracks with negative impact parameter is not affected by the cali- 
bration, and is used as further information in the b-tagging. Therefore this calibration 
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procedure is not correlated with the physics measurement, while it improves significantly 
the agreement between data and simulation. 

The overall performance of the combined b-tagging in hadronic radiative return events 
(e"'"e~— > Z°7), collected during the year 2000, is illustrated in Fig. ^ Effects of possible 
imperfect modelling of the high b-tag tail from non-b quarks were checked using the high 
energy semileptonic W+W" data and are also shown in Fig. 

2 Standard Model Higgs search 

The previous LEP combined limit on the Higgs mass at 95% CL was close to 
108 GeV/c^. Given the integrated luminosity corresponding to the data taken in the 
year 2000, the analysis is expected to cover efficiently the mass range up to the kinemat- 
ical limit allowed by the increase in centre-of-mass energy. 

The following improvements for this high mass range have been introduced in the 
analysis of the two main channels. 

The four-jet analysis benefits from a better tuned b-tagging procedure and although it 
keeps the same event variables in the analysis, the discriminant neural network has been 
optimized for the high mass hypotheses. 

The missing-energy channel includes a tighter preselection and additional variables 
in the likelihood, resulting in a better background rejection for a high mass Higgs; it is 
described in the following section. 



2.1 Hvv channel 

In this channel both the preselection and the final discriminating likelihood have been 
reoptimised in the spirit of a "background free" analysis. A set of stringent cuts was 
applied prior to the construction of the likelihood. 

The discriminating likelihood includes six variables defined after forcing the event into 
a two-jet configuration with the DURHAM [|I^ algorithm: acoplanarity, acollinearity. 



polar angle of the missing momentum with respect to the beam direction, b-tagging, 
invariant mass in the transverse plane, the minimum of the energies around the most 
isolated particle and around the most energetic particle (normalised to their own en- 
ergy). Three more variables are defined leaving the number of jets free in the DURHAM 
algorithm with i/cut = 0.005: the minimum angle between the jet directions and the miss- 
ing momentum in the transverse plane, the minimal jet charged multiplicity, and the 
maximum track or reconstructed lepton transverse momentum with respect to the jet 
axis. 

The effect of the preselection on data and simulated samples is shown in Table ^ After 
a tighter cut to select the most significant candidates, three candidate events remain, 
while 4.9 are expected according to the background simulation. 

Distributions for the most relevant variables in this analysis are shown both at prese- 
lection level (Fig. H), and at the tight selection level (Fig. |). 

The reconstructed Higgs boson mass is defined as the visible mass given by a one- 
constraint fit where the recoil system is assumed to be an on-shell Z° boson. It is used, 
together with the discriminant likelihood, in the two-dimensional computation of the 
confidence levels for the Higgs hypotheses. 
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Selection 


Data 


Background qq{'y) -^^-/ermzon 


Efficiency 


Huu channel 


preselection 


970 


880 467 


390 


67% 


candidates selection 


90 


99.7 50.4 


49.3 


60% 


tight selection 


3 


4.9 1.4 


3.5 


30% 


He"'"e channel 


preselection 


1242 


1172 745 


4I6 


78% 


candidates selection 


7 


11.6 0.5 


10.4 


57% 


tight selection 


1 


3.5 0.1 


3.2 


49% 


H/i"*"/! channel 


preselection 


3780 


3763 2671 


1067 


81% 


candidates selection 


7 


10.6 0.2 


10.4 


67% 


tight selection 


2 


3.6 0.1 


3.5 


56% 


r+r qq channel 


preselection 


9180 


8913 5425 


3468 


98% 


candidates selection 


5 


6.0 0.4 


5.6 


22% 


tight selection 


2 


4.1 0.1 


4.0 


19% 


Hqq channel 


preselection 


2266 


2342 680 


1662 


85% 


candidates selection 


398 


423.7 i5^.5 


268.8 


79% 


tight selection 


8 


7.4 2.8 


4.6 


36% 



Table 1: Effect of the selection cuts on data, simulated background and simulated signal 
events. The two main background contributions are detailed. Efficiencies are given for a 
signal of mn = 114 GeV/c^. Candidates selection indicates the number of events used as 
input to the confidence level calculations. The tight selection is obtained after a further 
cut in the corresponding discriminant variable, and corresponds to the one used in the 
mass plot (Fig. |^). 



2.2 Leptonic channels 

Higgs boson searches in events with jets and leptons follow the analysis applied to 
the 1999 data [|l|, which included a i/i dependence in the corresponding preselections. 
The effect of the selections on data and simulated samples is detailed in Table |1|. Good 
agreement between data and background simulation at the preselection level is observed 
in all the leptonic channels. 

In the He^e~ channel, 7 candidate events are selected in the data, for a total expected 
background of 11.6 events coming mainly from the e^e~qq process. In the H/x'^yU" 
channel, 7 events are selected and 10.6 background events are expected coming mainly 
from the /i"'"/i~qq process. Both channels use the b-tagging value as the discriminant 
variable and the ffited hadronic mass in the two-dimensional calculation of the confidence 
levels. One of the He"'"e~ and two of the H/i+/i~ candidates have a significant b-tagging 
value but are kinematically compatible with the ZZ hypothesis. 

In the r^r~qq channel, 5 candidates are selected, while 6.0 are expected from the 
Standard Model background, which is dominated by the ZZ into r+r^qq process. Two 
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events are selected after a cut on the discriminant likelihood at 0.1; neither has a high 
value for the rescaled mass. 



2.3 Higgs boson searches in four-jet events 

Higgs boson searches in fully hadronic final states start with a common four-jet pre- 
selection which eliminates hard radiative events and reduces the qq(7) and Z7* 
background, forcing all selected events into a four-jet topology with the DURHAM algo- 
rithm. 

The performance of the DELPHI b-tagging procedure in the four-jet analysis was 
specially optimized and enhanced by taking into account the dependence on additional 
variables related to the kinematical properties of b-hadrons produced in decays of the 
Higgs boson. These variables, defined for each jet in the event, are: the polar angle 
of the jet direction, the jet energy, the charged multiplicity of the jet, the angle to the 
nearest jet, the average transverse momentum of charged particles with respect to the 
jet direction, the number of particles with negative impact parameter and the invariant 
mass of the jet. Including this dependence in the tagging algorithm significantly improves 
the rejection of the light quark background. The global b-tagging value of the event is 
defined as the maximum b-tagging value for any di-jet in the event, computed as the sum 
of the corresponding jet b-tagging values. 

The final discriminant variable used in the four-jet channel is defined as the output of 
an artificial neural network (ANN) which combines 13 variables. 

The first variable is the global b-tagging value of the event. 

The next four variables rely on kinematics and test the compatibility of the event with 
the hypotheses of W"'"W~ and ZZ production to either 4 or 5 jets. Constrained fits are 
used to derive the probability density function measuring the compatibility of the event 
kinematics with the production of two objects of any masses. This two-dimensional prob- 
ability, the ideogram probability ||lT| , is then folded with the expected mass distributions 



for the W"'"W~ and ZZ processes, respectively. 

Finally, the last eight input variables intended to reduce the qq(7) contamination are 
the sum of the second and fourth Fox- Wolfram moments, the product of the minimum jet 
energy and the minimum opening angle between any two jets, the maximum and minimum 
jet momenta, the sum of the multiplicities of the two jets with lowest multiplicity, the 
sum of the masses of the two jets with lowest masses, the minimum di-jet mass and the 
minimum sum of the cosines of the opening angles of the two di-jets when considering 
all possible pairings of the jets. In the previous analysis []T| these eight variables were 
separately combined in an anti-QCD artificial neural network. 

Fig. ^ shows the performance of the final discriminating variable in the efficiency- 
background plane for a 114 GeV/c^ signal at ^/s =206.7 GeV. 

The choice of the Higgs di-jet makes use of both the kinematical 5C-fit probabilities 
and the b-tagging information in the event f^. The likelihood pairing function, 
■ ■ ((1 - i?f - i?f ) ■ ■ V^- + R§ ■ VI' ■ VI' + i?f ■ Vi? ■ Vt) ■ P^,, 
is calculated for each of the six possibilities to combine the jets ji,j2, js and J4. 
Vl' ,Vl\V^^ are the probability densities of getting the observed b-tagging value for the 
jet ji when originating from a 6, c or light quark, estimated from simulation. i?f and 
i?f are the hadronic branching fractions of the Z° into 6 or c quarks, and Pj^j^ is the 
probability corresponding to the kinematical 5C-fit with the jets J3 and assigned to 
the The pairing that maximises this function is selected. The proportion of right 
matchings for the Higgs di-jet, estimated in simulated signal events with 114 GeV/c^ 
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mass, is around 53% at preselection level, increasing to above 70% at the tighter level, 
keeping a low rate of wrong pairings for ZZ background events. 

The good agreement between data and background simulation after the four-jet pre- 
selection is illustrated in Fig. |^ which shows the distributions of the global b-tagging, 
the two ideogram probabilities for the configuration with 4 jets, and the output of the 
anti-QCD ANN. The results for the different selection levels are given in Table |T]. The 
tighter cut at an ANN value of 0.7 selects 8 events in data while 7.4 are expected from 
the background simulation. Fig. ^ shows the previous variables at this level. 



2.4 Confidence level estimation 

The confidence levels for the background (CLb) and signal plus background (CLs+b) 
hypotheses are defined as the probability in the two cases of observing a likelihood ratio 



Q, greater than or equal to that measured in the data [12|. The confidence level for 
the signal case is calculated consistently with the LEP Higgs Working group using the 
conservative ratio CLg = CLg+b/CLb. 

The likelihood ratio for a given Higgs mass hypothesis is defined as ln(Q) = —S + 
Sjln(l + Si/bi) where S is the total expected signal, and Si and bi are the signal and 
background probability densities for each candidate i, calculated using two-dimensional 
information, where one dimension is the reconstructed Higgs boson mass and the other 
is the channel dependent discriminant variable. 

These densities are represented as two-dimensional histograms which are derived from 
the simulation samples described in section 1.1. These distributions are then smoothed 
using a two-dimensional kernel, which is essentially Gaussian but with a small longer 
tailed component. The width of the kernel varies from point to point, such that the 
statistical uncertainty on the estimated background is never more than 30%. The same 
width is applied to background and all signal samples to eliminate the possibility of the 
smearing itself increasing the estimated signal to background ratio. Finally the distribu- 
tion is reweighted so that when projected onto either axis it has the same distribution 
as would have been observed if the smoothing had been only in one dimension. This 
makes better use of the simulation statistics if there are features which are essentially 
one dimensional, such as mass peaks, and it has been verified that the systematic errors 
introduced are significantly smaller than the statistical ones. 

The resultant two-dimensional distributions are then linearly interpolated from the 
simulation conditions to the appropriate beam energy and Higgs mass hypotheses. 



3 Results 

The distribution of the reconstructed Higgs boson mass summed over all channels, at 
the level of the tight selection, is presented in Fig. |^. 

The limit on the Standard Model Higgs boson mass is set combining the data analysed 
in the previous sections with those taken at lower energies, namely 161 and 172 GeV 11131 , 
183 GeV II, 189 GeV § and 192-202 GeV ll|]. 

The confidence level for the signal hypothesis CLg is shown in Fig. as well as the 
confidence level for the background hypothesis in the form 1-CLb. A slight deficit with 
respect to the expected background is observed, and a 95% CL lower limit on the mass 
is set at 114.3 GeV/c^ while the expected median is 113.5 GeV/c^ . The test-statistic 
(negative log-likelihood ratio) is shown in Fig. |^. 
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It has been noticed that the combined LEP resuh is better described if a Higgs 
boson with mass 115 GeV/c^ is present. For such a signal, the DELPHI CLg+b value is 
3%, while the CLb is 23%. The CLs for this hypothesis is 12%, so that the present data 
are not incompatible with the existence of a Higgs boson with this mass. This can also be 
seen in Fig. ^ where the result is compared with the probability density for background 
and background plus signal experiments. 

4 Conclusions 

The data taken by DELPHI at 200-209 GeV in the year 2000 have been analysed 
to search for the Standard Model Higgs boson. The data for all channels is compatible 
with expectations from the Standard Model background. In combination with previous 
DELPHI results at lower centre-of-mass energies, a lower limit at 95% CL on the mass 
of the Standard Model Higgs boson is set at 114.3 GeV/c^, while the expected median 
limit is 113.5 GeV/c^. 
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Figure 1: Top: distributions of the combined b-tagging variable, for the year 2000 ra- 
diative return Z7 data (dots) and simulation (histogram). The expected contribution of 
udsc-quarks and non-qq 7 background is shown as the dark histogram. Bottom: same 
distribution for semileptonic W"'"W~ high energy events in the 2000 data. The shaded 
histogram corresponds to the expected contribution from other processes, and shows the 
high purity of the selection. 



9 




50 100 150 50 100 150 



Acoplanarity (deg) 




200 



Figure 2: YLuu channel: distributions of relevant analysis variables, at the preselection 
level. Data at y/s = 200-209 GeV (dots) are compared with Standard Model back- 
ground expectations (left-hand side histograms) and with the expected distribution for a 
114 GeV/c^ Higgs mass signal (right-hand side histogram). 
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Figure 3: Huu channel: same distributions as in FigJ^ but at the tight selection level. 
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Figure 4: Hqq channel: Expected Standard Model background rate at a centre-of-mass 
energy ^/s =206.7 GeV as a function of the efficiency for a 114 GeV/c^ fbggs mass 
signal when varying the cut on the neural network variable. The different background 
contributions are shown summed and separately. Dots stand for data. 
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Figure 5: Hqq channel: distributions of relevant analysis variables at the preselection 
level. The eight variables used to reduce the qq(7) background are summarized by the 
output of the anti-QCD neural network. Data at y/s = 200-209 GeV (dots) are compared 
with Standard Model background expectations (left-hand side histograms) and with the 
expected distribution for a 114 GeV/c^ Higgs mass signal (right-hand side histogram). 
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Figure 6: Hqq channel: same distribution as in Fig.^ but at the tight selection level. 
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Figure 7: Distribution of the reconstructed mass of the candidates when combining all 
HZ analyses at 200-209 GeV in the year 2000. Data (dots) are compared with the Stan- 
dard Model background expectations (light shaded histogram) and with the normalised 
114 GeV/c^ signal spectrum added to the background contributions (dark shaded his- 
togram). 
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Figure 8: Confidence levels as a function of mn . Curves are the observed (solid) and 
expected median (dashed) confidences from background-only experiments while the bands 
correspond to the 68% and 95% confidence intervals from background-only experiments. 
Top: CLg, the confidence level for the signal hypothesis as a function of mn . The 
intersections of the curves with the horizontal line at 5% define the observed and expected 
95% CL lower limits on mn at 114.3 and 113.5 GeV/c^ respectively. Bottom: 1-CLb for 
the background hypothesis. Also shown here is the curve of the median confidence as 
expected for a signal of mass given in abscissa (dotted line). 



16 




-5 



■10 



mH=115.0 



Observed 

Expected background 
Expected signal + background 

I I I I I I I I I I I I I I I I I I I T I I I I 



100 102 104 106 108 110 112 114 116 118 120 

GeY/c') 




5 10 
-2lnQ(115) 

Figure 9: Top: the test-statistic (negative log-likelihood ratio) as a function of mn . 
The observed value, full line, is compared to the expectation for the background only 
hypothesis, represented by the dashed line and the symmetric 68% and 95% probability 
shaded bands. The dot-dashed line shows the average expected result for a hypothetical 
Higgs mass of 115 GeV/c^. Bottom: vertical slice of the previous plot for a mass value 
of 115 GeV/c^, showing the sensitivity of the DELPHI result to this hypothesis. The 
dot-dashed line shows the expected distribution for signal plus background, the dashed 
line that for background only. The vertical line represents the data. The fractional area 
below the dashed curve and to the right of the data is CLb; for the dot-dashed curve it 
is CL(s+b). 



